Transcriptional regulation of the human beta3-adrenergic receptor gene

ABSTRACT

Regulatory elements responsible for tissue-specific transcriptional regulation of the human β 3 -adrenergic receptor (β 3 -AR) were identified. A region localized between −6.50 and −6.30 kb of the proximal promoter contained three sequences that act synergistically to achieve full transcriptional activity. One segment, termed segment A, contains an Sp1 binding site. Another of the sequences, termed segment B, is a binding site for a trans-acting factor present in cells that constitutively express β 3 -AR. In a specific embodiment, the trans-acting factor is expressed in neuroblastoma (SK-N-MC) and brown adipose tissue cells, but little or not at all in CV-1, HeLa, or white adipose tissue cells. The third segment, C, is an S1 nuclease-sensitive site having CCTT repeats. Recombinant vectors under control of this transcriptional regulation region, particularly containing the B and C segments, provide a substrate for high throughput assays, such as reporter gene assays, to identify compounds that can increase the level of expression of β 3 -AR. The B segment nucleic acids also provide for isolation and cloning of the trans-acting factor. Mechanisms of transcriptional regulation and identification of other adjacent proteins involved in the regulation of the hβ 3 -AR gene expression are provided.

[0001] This is a division of application Ser. No. 09/761,116, filed Jan. 16, 2001, which is a division of U.S. Pat. No. 6,197,580, issued on Mar, 6, 2001, each of which is herein incorporated by reference in their entirety.

FIELD OF THE INVENTION

[0002] The present invention relates to a positive cis-regulatory (enhancer) element of human β₃-adrenergic receptor responsible for its transcription in SK-N-MC cells. Such element is composed of three DNA binding sites that act synergistically and is located 6.5 kb upstream from the translation start site of the β₃-adrenergic receptor. The invention further relates to use of this enhancer element for regulated gene expression and for drug screening.

BACKGROUND OF THE INVENTION

[0003] The β₃-adrenergic receptor (β₃-AR) is an important regulator of metabolic activity in brown and white adipose tissue, two major sites for regulation of energy balance. The β₃-AR belongs to a family of G-protein coupled receptors. Its binding to endogenous ligand or specific synthetic agonist leads to activation of adenylate cyclase, an increased concentration of cAMP, and an increased activity of PKA, resulting in increased thermogenic activity and heat production in brown adipose tissue (BAT) and lipolysis in white adipose tissue (WAT). Since β₃-AR stimulation causes an increase in thermogenic activity and a less efficient utilization of metabolic fuels, its sustained activation should be of benefit in the treatment of obesity, and improvement of glycemic control in type II diabetes. Indeed, numerous reports have shown that stimulation of β₃-ARs cause weight loss and improvement in glycemic control in rodent models of these diseases (Carroll et al., Diabetes 34:1198-1204, 1984; Umekawa et al., European Journal of Endo.136:429-437, 1997; Yoshida et al., J. Nutr. Sci. Vitaminol (Tokyo) 36:75-80, 1990; Yoshida et al., Endocrinol Japon 38:397-403, 1991; Smith et al., New Antidiabetic Drugs, Ed.: Bailey, C. J., and Flatt, P. R., London, 1990; Largis et al., Drug Dev. Res. 32:69-76, 1994; Bloom et al., Journal of Medicinal Chemistry 35:3081-3084, 1992; Cawthorne et al., American Journal of Clinical Nutrition 55:252S-257S, 1992).

[0004] Despite the well characterized role of β₃-ARs in rodents, its role in the regulation of energy balance in man is still not clear. Observations by several groups of investigators (Walston et al., New Engl. J. Med. 333:343-347, 1995; Kadowaki et al., Biochem. Biophys. Res. Commun. 215:555-560, 1995; Yoshioka et al., Diabetologia 39: 1410-1411, 1996; Fujisawa et al., Diabetologia 39:349-352, 1996; Kurabayashi et al., Diabetes 45:1358-1363, 1996; Zhang et al., Diabetologia 39:1505-1511, 1996; Widen et al., New Engl. J. Med. 333:348-351, 1995; Clement et al., New Engl. J. Med. 333:352-354, 1995) describing the existence of a positive correlation between an Arg to Trp mutation at position 64 in the human β₃-AR (hβ₃-AR ) gene and the early onset of non-insulin dependent diabetes mellitus (NIDDM) (Walston et al., supra; Yoshioka et al., supra; Fujisawa et al., supra; Kurabayashi et al., supra; Widen et al., supra), insulin resistance (Walston et al., supra), increased weight gain (Kadowaki et al., supra; Yoshioka et al. supra; Fujisawa et al. supra; Zhang et al, supra; Clement et al., supra) and abdominal obesity (Walston et al., supra) seemed to validate the role of the β₃-AR in the development of obesity and diabetes. However, other investigators have failed to observe any correlation between the prevalences of the Trp64Arg mutation and obesity or diabetes in the patient populations they have studied (Li, et al., Diabetologia 39:857-860, 1996; Elbein et al., J. Clin. Endocrinol. Metab. 81:4422-4427, 1996; Ueda et al., Metabolism 46:199-202, 1997; Awata et al. Diabetes Care 19:271-272, 1996). Further, the pharmacology and biology of the wild type and mutated receptors did not differ when compared in cell based systems (Candelore et al., Endocrinology 137:2638-2641, 1996; Pietri-Rouxel et al., Eur.J. Biochem. 247:1174-11791 1997), although it does appear that the mutations cause less accumulation of cAMP, indicating diminished signal transduction efficiency in the cells that contained mutated hβ₃-AR. The physiological significance of this observation is not yet clear. The success achieved in the treatment of obesity and diabetes in rodent models with selective β₃-AR agonists supported a role for this receptor as a therapeutic target and has prompted a great effort to develop compounds with a high affinity and selectivity towards the human β₃-AR.

[0005] Despite the approximately 80% homology in amino acid sequences and similar adipose tissue expression pattern, human and mouse β₃-ARs differ in two very important ways. First, the mouse β₃-AR shows a high affinity and selectivity for certain β₃-AR specific agonists, while the human receptor has little or no affinity for the same agonists when tested in CHO cells stably transfected with human β₃AR (Liggett et al., Molecular Pharmacology, 42:634-637, 1992). On the other hand, agonists that show high affinity and selectivity in β₃-AR transfected CHO cells that express a high level of human β₃-ARs have little or no activity in vivo (when tested in non human primates and clinical trials). Their lack of robust activity may be due to pharmacokinetic or metabolic issues, but another reason for this discrepancy may be the low level of β₃-AR expression in target tissues (Wilson et al., The Journal of Pharmacology and Experimental Therapeutics 279:214-221, 1996).

[0006] Second, in rodents β₃-AR mRNA is predominantly detected in brown and white adipose tissue (Granneman et al., Mol. Pharmacol. 40:895-899, 1991; Muzzin et al, J. Biol Chem. 266:24053-24058, 1991), while in man β₃-ARs are expressed in BAT but appear to have little or no expression in WAT (Krief et al., J. Clin. Invest. 91:344-349, 1993; Thomas and Liggett, Mol. Pharmacol., 43:343-343, 1991). However, other studies support the existence of functionally active β₃-ARs in human WAT, showing an increase in lipolysis and glycerol formation after treatment with CGP-12177 (a selective agonist for the human β₃-AR) both in vivo (Lonnqvist, Br. J. Pharmacol., 1993) and in vitro (Enocksson et al., J. Clin. Invest. 95:2239-2245, 1995). The significance of these studies in terms of the importance of β₃-ARs in the regulation of WAT physiology remains to be proven, since CGP-12177 might interact with other as yet undescribed adrenergic receptors (Kaumann, A. J., Trends in Pharmacological Sciences 18:70-76, 1997).

[0007] The low affinity for synthetic agonists together with a low level of expression of hβ₃-ARs in WAT and the relatively small amount of BAT in humans may explain why agonists developed so far have been inactive in man.

[0008] Although the beneficial effects of an increased presence of β₃-ARs in man can be envisioned, research leading to a better understanding of the factors that regulate the hβ₃-AR gene expression has been limited. Given the possibility that increased expression of β₃-AR in humans will lead to an increased accessibility of receptors in WAT and BAT, as well as to the recruitment of more brown adipocytes, there is a need in the art to better understand mechanisms and factors that regulate transcription of hβ₃-ARs.

[0009] This problem is rendered more complex due to the lack of human brown and white adipose tissue cell lines. Although the genes for mouse (Nahmias et al., EMBO J. 10:3721-3727, 1991), rat (Granneman et al., Mol. Pharmacol. 40:895-899, 1991), and human β₃-ARs (Granneman et al., Mol. Pharmacol. 44:264-270, 1993); Granneman et al., Mol. Pharmacol. 42:964-970, 1992; Emorine et al., Science 245:1118-1121) have been cloned, little additional data is available about the structure of regulatory regions and possible transcription factors close to the proximal promoter that may play a role in β₃-AR transcriptional regulation (Liggett et al., J. DNA Sequencing and Mapping 2:61-63, 1991). Thus, there is a need in the art to provide the endogenous regulatory sequences of β₃-AR.

SUMMARY OF THE INVENTION

[0010] In one aspect, the present invention provides an isolated nucleic acid comprising a nucleotide sequence that is greater than 80% identical to the nucleotide sequence GCCTCTGGGGAG (SEQ ID NO:1). This sequence is specifically recognized by a transcription factor, and is responsible for tissue specific expression of β₃-AR. The invention further provides this nucleic acid further comprising a nucleotide sequence that binds an Sp-1 transcription factor protein, or contains an S1 nuclease sensitive site, or both. A combination of these sequences in proximity to each other in a nucleic acid, termed herein a β₃-AR enhancer element, permits positive regulation of expression of a gene operably associated with the sequences. Host cells containing vectors, with genes operably associated with the enhancer element, and both transient and stable expression of such genes, are also provided.

[0011] The invention further provides a specific β₃-AR trans-activating factor polypeptide having the following characteristics: (a) it binds specifically to the nucleic acid having the sequence GCCTCTGGGGAG (SEQ ID NO:1); (b) it is expressed by mouse brown adipose tissue cells; (c) it is expressed at very low levels by human white adipocytes isolated from the perirenal depot; (d) an AP-2 binding oligonucleotide does not compete with a nucleic acid having the nucleotide sequence GCCTCTGGGGAG (SEQ ID NO:1) for binding the polypeptide; and (e) when complexed to a nucleic acid comprising SEQ ID NO:1, it is not recognized by an antibody to AP-2, e.g., as detected in a super shift assay.

[0012] In another aspect, the invention provides a method of isolating a polypeptide that binds specifically to a nucleic acid having a nucleotide sequence GCCTCTGGGGAG (SEQ ID NO:1). The method comprises contacting a composition suspected of containing the polypeptide with the nucleic acid under conditions that permit detection of binding of the polypeptide to the nucleic acid; and isolating the bound polypeptide.

[0013] In still another embodiment, the invention provides a method of screening for a compound that increases activity of a β₃-AR trans-activating factor in human cells. This method comprises contacting cells with a test compound; and detecting an increase in a level of activity of the β₃-AR trans-activating factor.

[0014] A related method of the invention provides for screening for a compound that inhibits activity of a β3-AR trans-activating factor in human cells. Such a method comprises contacting cells with a test compound; and detecting a decrease in a level of activity of the β₃-AR trans-activating factor.

[0015] In both screening methods, the test cells are capable of producing, i.e., expressing, one or more components of the β₃-AR trans-activating factor.

[0016] These and other aspects of the present invention are further elaborated in the drawings and detailed description that follow.

BRIEF DESCRIPTION OF THE DRAWINGS

[0017]FIG. 1: Partial restriction enzymes map of 7A human β₃-AR clone. A human lung genomic library was screened with a probe that corresponds to the full cDNA region except the last 6 amino acids. The probe was made by ligation of 4 PCR products that overlap partial cDNA for the hβ₃-AR gene. The map shows restriction enzymes sites and their corresponding position within the genomic DNA from the translation start site (PsI-Pst I, Avr-AvII, EV-EcoRV, E1- EcoRI, BE-BstEII, SI-SalI).

[0018]FIG. 2: Determination of transcription start site of hβ₃-AR gene using 5′ RACE. 5′ RACE was performed on poly A RNA isolated from SK-N-MC cells using the Marathon cDNA amplification kit (Clontech). Twenty subcloned RACE-PCR products were subcloned and sequenced. Capital letters in the Figure represent translation start sites. The underlined sequence represents the primer used in 5′ RACE; determined transcription start sites are indicated with asterisks.

[0019]FIG. 3: Partial restriction enzymes map of constructs used in transient transfection experiments. All promoter regions were cloned in a pGL3 basic vector. The positions of the restriction enzyme sites are based on assigning the translation start site as +1.

[0020]FIG. 4: Constructs to evaluate the effect of far upstream region on transcription activity. Transfection constructs were made by keeping region between −7 to −5.6 and deleting regions between Avr II and EcorV, EcoRI and BstEII to make dEVhβ3AR/luc, dEIhβ3AR/luc and dBhβ3AR/luc respectively.

[0021]FIG. 5: Further analysis of 1.5 kb of distal promoter. A series of PCR products were made and ligated to a TK minimal promoter within pGL3 basic.

[0022]FIGS. 6A, B, C and D: EMSA experiments with oligonucleotides generated from sequence between −6.508 and −6.308 kp from 5′ flanking region of hβ₃-AR gene. A: Sequence of 200 bp between primers “6” (−6.508 kb) and “8” (−6.308 kb). EMSA were done using underlined sequences as oligonucleotides, which are marked as 1, 2, 3, 4, and 1A to cover the 200 bp region, and 2A, 3A, 4A, and 1B representing overlaps between oligonucleotides 1 and 2, 2 and 3, 3 and 4, and 1A and 4, respectively. Experiments were done with doublestranded labeled oligonucleotides under the conditions described in experimental procedures. B and C: Nuclear extracts from SK-N-MC, CV-1 and HeLa cells were incubated with radiolabeled oligonucleotides described above. The Figure shows only oligonucleotides that show binding. B: Oligonucleotides 2, 2A, 3A, and C: 1B and 4A. D: Also, EMSAs were done in a presence of excess of indicated cold oligonucleotides. The amount of cold oligos was a 50x molar excess of labeled oligonucleotides unless otherwise indicated. Arrows indicate the position of major DNA-protein complexes.

[0023] FIGS. 7A and 7B: Mutational analysis of regions “A” and “B”. A: Segment “A” oligonucleotides (SEQ ID NOS:33-40) representing the overlap between oligonucleotides 1 and 2. B: Segment “B” oligonucleotides (SEQ ID NOS:41-48) representing the overlap between oligonucleotides 2 and 3A. Mutated oligonucleotides are shown, with mutated bases underlined. Small letters correspond to restriction enzymes sites for easier cloning and purification of doublestranded oligonucleotides.

DETAILED DESCRIPTION OF THE INVENTION

[0024] The present invention is based, in part, on the discovery of a new regulatory region that appears to be an enhancer element for the human β₃-AR gene. Mechanisms of transcriptional regulation of the human β₃-adrenergic receptor were studied using SK-N-MC cells, a human neuroblastoma cell line that expresses β₃- and β₁-adrenergic receptors endogenously. A genomic 7 kilobase (kb) region of the human β₃-adrenergic receptor 5′ flanking region was isolated. Transfection constructs that contain deletions within this 7 kb region linked to a luciferase reporter gene were made and transfected in SK-N-MC, CV1 and HeLa cells. Maximal luciferase activity was observed only with the presence of a 200 basepair (bp) region located far upstream (between −6.5 kb and −6.3 kb) from the translation start site. This region was also shown to determine cell-specific expression in SK-N-MC cells, but not CV1 or HeLa cells. Electrophoretic mobility shifts assays using nuclear extracts from SK-N-MC, CV-1 and HeLa cells with oligonucleotides that cover this 200 bp promoter region revealed the existence of three cis-regulatory elements (segments A, B, and C), or protein binding segments, that act synergistically to achieve full transcriptional activity. Mutational analysis, along with antibody-supershift studies and competition experiments, implicated an Sp1 transcription factor as a part of the positive regulatory element (segment A). Segment B represents a novel DNA binding site that binds a protein present in nuclear extracts from SK-N-MC cells and brown adipose tissue, but which was not detectable in other cells, notably white adipose tissue by methods used in the Examples, infra. These data support the brown adipose tissue-specific expression of the β₃-adrenergic receptor. Segment C binds protein present in SK-N-MC and HeLa cells; it also exhibits an S1 nuclease hypersensitive site. These data taken together indicate the existence of cell specific positive cis-regulatory elements located 6.5 kb upstream from the translation start site that play a pivotal role in transcriptional regulation of the human β₃-adrenergic receptor.

[0025] The nucleotide sequence of a nucleic acid (DNA) that binds a tissue-specific trans-activating factor has been identified in segment B of the β₃-AR regulatory region. The sequence is greater than 80% identical (at least 10 of 12 bases are the same) to the core nucleotide sequence GCCTCTGGGGAG (SEQ ID NO:1). In a specific embodiment, the sequence of this segment is GCCTCTGGGGAG (SEQ ID NO:1). In another specific embodiment, the sequence is 78% similar to an ERF AP-2 consensus binding site. However, an oligonucleotide comprising the segment B core sequence cannot displace binding of an AP-2 oligonucleotide by an AP-2 protein, even at a 100-fold molar excess. Segment B can be further characterized by its specificity for binding with a trans-activating factor found at detectable levels in brown adipose tissue (BAT) and in a human neuroblastoma cell line, but which is not detectably present in HeLa cells, or in white adipose tissue (WAT) (murine, and probably human as well). Thus, in a specific embodiment, an isolated nucleic acid comprising a nucleotide sequence of the invention can be identified by its specificity for the particular trans-activating factor.

[0026] The new trans-activating factor is termed herein the “B segment-binding trans-activating factor.” This B segment-binding trans-activating factor polypeptide appears to be an AP-2-like protein. This conclusion is based on the similarity of the B segment sequence, specifically SEQ ID NO:1, to an ERF consensus binding site that belongs to a family of AP-2 transcription factors. In a specific embodiment, infra, the B segment-binding trans-activating factor binds a nucleic acid comprising the sequence GCCTCTGGGGAG (SEQ ID NO:1) with high enough affinity that an AP-2 oligonucleotide competitor failed to displace it, even at a 200-fold molar excess. In a further embodiment, the B segment-binding trans-activating factor does not bind an antibody that recognizes AP-2.

[0027] A second protein-binding segment that operates synergistically with segment B in the regulatory region has also been identified. Because in a specific embodiment this segment is upstream (5′) to segment B, it has been termed segment A. Segment A binds an Sp1-like transcription factor. In specific embodiments, segment A is displaced from binding a protein from cellular nuclear extracts by an Sp1 oligonucleotide. In another embodiment, a protein that binds to segment A is recognized by an anti-Sp1 antibody. Thus, in a specific embodiment, segment A is an Sp1 binding site. In a further specific embodiment, exemplified infra, the nucleotide sequence of the binding site is AGGTGGGACT (SEQ ID NO:2). This binding site sequence differs from known Sp1 sequences. It contains a “GGTG” motif, whereas known Sp1 sequences have a “GGCG” motif.

[0028] A third protein-binding segment that is necessary for the positive regulatory effects of segment A and segment B, alone and in combination, has also been identified. Because in a specific embodiment exemplified infra this segment is 3′ to segment B, it has been termed segment C. Segment C is an S1 nuclease-sensitive site. In a specific embodiment, it comprises at least 12 bases, and preferably about 80 bases, of a homopurine-homopyrimidine rich region. In a more specific embodiment, segment C comprises at least 3, and preferably about 20, repeats of the sequence CCTT. In a specific embodiment exemplified infra, there are 19 repeats of CCTT and one repeat, in the 19^(th) position, with the sequence TCTT (for a total of 20 repeats). Segment C is found to be essential for transcriptional activity of segments A and B.

[0029] The tissue-specific trans-activating factor binding sequence that has been termed herein segment B, along with the Sp1 transcription activation factor binding sequence termed herein segment A and the S1 nuclease-sensitive segment that is a protein binding site termed herein segment C (collectively, regulatory segments), can be combined to create a regulatory region that positively regulates gene expression in a tissue-specific manner. The regulatory region of the invention comprising all three segments has the attributes of an enhancer element, and the term “β₃-AR enhancer element” or “enhancer element” may be used herein for the regulatory region of the invention comprising all three segments. In a specific embodiment, the enhancer element is a 200 base-pair sequence as depicted in FIG. 6A (SEQ ID NO:3).

[0030] As used herein, a “β₃-AR trans-activating factor” refers to a polypeptide or polypeptide complex that recognizes the enhancer element. This complex includes the AP-2 like B segment trans-activating factor polypeptide, Sp1, and the C segment-specific polypeptide. It also includes other binding factors that may interact with genomic DNA sequences present on the approximately 7 kb DNA upstream of the β₃-AR start site.

[0031] A nucleic acid vector, such as a plasmid, containing these sites, or combinations of A and C or B and C, in proximity to each other and upstream a distance of about 0 to about 10 kb from a promoter region operatively associated with a gene on an expression vector can increase the level of expression of the gene. Inclusion of the B segment confers tissue specificity: expression of the gene under control of the enhancer element only occurs in cells that have a transcription factor that binds to the core binding sequence of segment B. All three segments together (the enhancer element) synergistically increase expression levels of a gene operably associated with the enhancer element under appropriate expression conditions (i.e., when the B segment binding protein is present in the cell). The three segments can be oriented in either orientation in the 5′ flanking region of a gene. Thus, in one embodiment, the segments are arranged in A-B-C order on the coding strand relative to the translation start site. In another embodiment, the segments are arranged in A-B-C order on the non-coding strand (and thus in [-C]-[-B]-[-A] order on the coding strand, where [-C] is the complement of the C segment, [-B] is the complement of the B segment, and [-A] is the complement of the A segment). Preferably, to ensure proper positioning to allow interaction between binding factors, and to reveal, to the extent possible, activation and binding domains after conformational changes, the three segments are arranged in A-B-C order, with the relative spacing between each segment found, e.g., in SEQ ID NO:3.

[0032] As used herein, the term “proximity” when applied to the protein binding segments of the enhancer element means that the sites are located in a range from 1 to about 100 nucleotides from each other, and preferably, from about 10 to about 30 nucleotides from each other. In specific embodiments, segment A is located about 14 nucleotides upstream of segment B, and segment B is located about 31 nucleotides upstream of segment C.

[0033] In a specific embodiment, exemplified infra, the enhancer element is found in a nucleic acid (genomic DNA) isolated from a region about 7 kb upstream (−7 kb) of the β₃-adrenergic receptor (β₃-AR) transcription initiation (start) site. More specifically, the enhancer element can be located between −6.5 kb and −6.3 kb of the translation start site of the gene. In particular, a 200 bp region of the 5′ flanking region of β₃-AR demonstrates the enhancer activity. In other embodiments, exemplified infra, location of the enhancer adjacent to 500 bp of the translation start site of the gene, and in locations between −7 kb and −0.5 kb, results in positive regulation of gene expression. The enhancer element operates with a heterologous promoter. In a specific embodiment, the enhancer element regulates gene expression from a herpes simplex virus (HSV) thymidine kinase (tk) minimal promoter. In another embodiment, the enhancer element operates more efficiently in conjunction with a β₃-AR promoter (found −0.5 kb from translation start site).

[0034] As used herein, the term “about” or “approximately” means within 20%, preferably within 10%, and more preferably within 5% of a given value or range.

[0035] As used herein, the term “isolated” means that the referenced material is free of components found in the natural environment in which the material is normally found. In particular, isolated biological material is free of cellular components. In the case of nucleic acid molecules, an isolated nucleic acid includes a PCR product, an isolated MRNA, a cDNA, or a restriction fragment. In another embodiment, an isolated nucleic acid is preferably excised from the chromosome in which it may be found, and more preferably is no longer joined to non-regulatory, non-coding regions, or to other genes, located upstream or downstream of the gene contained by the isolated nucleic acid molecule when found in the chromosome. In yet another embodiment, the isolated nucleic acid lacks one or more introns. Isolated nucleic acid molecules can be inserted into plasmids, cosmids, artificial chromosomes, and the like. Thus, in a specific embodiment, a recombinant nucleic acid is an isolated nucleic acid. An isolated protein may be associated with other proteins or nucleic acids, or both, with which it associates in the cell, or with cellular membranes if it is a membrane-associated protein. An isolated organelle, cell, or tissue is removed from the anatomical site in which it is found in an organism. An isolated material may be, but need not be, purified.

[0036] The term “purified” as used herein refers to material that has been isolated under conditions that reduce or eliminate unrelated materials, i.e., contaminants. For example, a purified protein is preferably substantially free of other proteins or nucleic acids with which it is associated in a cell; a purified nucleic acid molecule is preferably substantially free of proteins or other unrelated nucleic acid molecules with which it can be found within a cell. As used herein, the term “substantially free” is used operationally, in the context of analytical testing of the material. Preferably, purified material substantially free of contaminants is at least 50% pure; more preferably, at least 90% pure, and more preferably still at least 99% pure. Purity can be evaluated by chromatography, gel electrophoresis, immunoassay, composition analysis, biological assay, and other methods known in the art.

Vector Constructs

[0037] The protein binding segments identified herein can be used in recombinant construction of expression vectors with positive regulation of gene expression. As noted above, the A and C or B and C segments can be used together to achieve positive regulation of gene expression. Furthermore, use of all three segments synergistically enhances gene expression. Accordingly, as used herein, use of a regulatory element of the present invention in a recombinant expression vector relates to any of the foregoing combinations, unless a specific combination is explicitly stated.

[0038] In accordance with the present invention there may be employed conventional molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (herein “Sambrook et al., 1989”); DNA Cloning: A Practical Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization [B. D. Hames & S. J. Higgins eds. (1985)]; Transcription And Translation [B. D. Hames & S. J. Higgins, eds. (1984)]; Animal Cell Culture [R. I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; B. Perbal, A Practical Guide To Molecular Cloning (1984); F. M. Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994).

[0039] Therefore, if appearing herein, the following terms shall have the definitions set out below.

[0040] A “vector” is a recombinant nucleic acid construct, such as plasmid, phage genome, virus genome, cosmid, or artificial chromosome to which another DNA segment may be attached. In a specific embodiment, the vector may bring about the replication of the attached segment, e.g., in the case of a cloning vector. A “replicon” is any genetic element (e.g., plasmid, chromosome, virus) that functions as an autonomous unit of DNA replication in vivo, i.e., it is capable of replication under its own control.

[0041] A “cassette” refers to a segment of DNA that can be inserted into a vector at specific restriction sites. The segment of DNA that can be inserted encodes a polypeptide of interest, and the cassette and restriction sites are designed to ensure insertion of the cassette in the proper reading frame for transcription and translation.

[0042] A cell has been “transfected” by exogenous or heterologous DNA when such DNA has been introduced inside the cell. A cell has been “transformed” by exogenous or heterologous DNA when transfected DNA is expressed.

[0043] The term “heterologous” refers to a combination of elements not naturally occurring. For example, heterologous DNA refers to DNA not naturally located in the cell, or in a chromosomal site of the cell. Preferably, the heterologous DNA includes a gene foreign to the cell. A heterologous expression regulatory element is such an element operatively associated with a different gene than the one it is operatively associated with in nature. For example, a regulatory element of the invention is operatively associated with a heterologous gene when the gene is not a gene encoding human β₃-AR.

[0044] A “nucleic acid molecule” refers to the phosphate ester polymeric form of ribonucleosides (adenosine, guanosine, uridine or cytidine; “RNA molecules”) or deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or deoxycytidine; “DNA molecules”), or any phosphoester analogs thereof, such as phosphorothioates and thioesters, in either single stranded form, or a double-stranded helix. Double stranded DNA:DNA, DNA:RNA and RNA:RNA helices are possible. The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only to the primary and secondary structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this term includes double-stranded DNA found, inter alia, in linear or circular DNA molecules (e.g., restriction fragments), plasmids, and chromosomes. In discussing the structure of particular double-stranded DNA molecules, sequences may be described herein according to the normal convention of giving only the sequence in the 5′ to 3′ direction along the nontranscribed strand of DNA (i.e., the strand having a sequence homologous to the mRNA). A “recombinant DNA molecule” is a DNA molecule that has undergone a molecular biological manipulation.

[0045] A “gene” is used herein to refer to a portion of a DNA molecule that includes a polypeptide coding sequence operatively associated with expression control sequences. In one embodiment, a gene can be a genomic or partial genomic sequence, in that it contains one or more introns. In another embodiment, the term gene refers to a cDNA molecule (i.e., the coding sequence lacking any introns).

[0046] A DNA “coding sequence” is a double-stranded DNA sequence which is transcribed and translated into a polypeptide in a cell in vitro or in vivo when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a start codon at the 5′ (amino) terminus and a translation stop codon at the 3′ (carboxyl) terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, cDNA from eukaryotic niRNA, genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. If the coding sequence is intended for expression in a eukaryotic cell, a polyadenylation signal and transcription termination sequence will usually be located 3′ to the coding sequence.

[0047] “Expression control sequences”, e.g., transcriptional and translational control sequences, are regulatory sequences that flank a coding sequence, such as promoters, enhancers, suppressors, terminators, and the like, that provide for the expression of a coding sequence in a host cell. In eukaryotic cells, polyadenylation signals are control sequences. On mRNA, a ribosome binding site is an expression control sequence.

[0048] A coding sequence is “operatively associated with” or “under the control” of transcriptional and translational control sequences in a cell when RNA polymerase transcribes the coding sequence into MRNA, which is then trans-RNA spliced and translated into the protein encoded by the coding sequence.

[0049] A “promoter sequence” is a DNA regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3′ direction) coding sequence. For purposes of defining the present invention, the promoter sequence is bounded at its 3′ terminus by the transcription initiation site and extends upstream (5′ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site (conveniently defined for example, by mapping with nuclease S1), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase. In a specific embodiment, the β₃-AR promoter extends about 0.5 Kb 5′ to the translation start site.

[0050] A nucleic acid molecule is “hybridizable” to another nucleic acid molecule, such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and solution ionic strength (see Sambrook et al., supra). The conditions of temperature and ionic strength determine the “stringency” of the hybridization. For preliminary screening for homologous nucleic acids, low stringency hybridization conditions, corresponding to a T_(m) of 55° C., can be used, e.g., 5×SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5×SSC, 0.5% SDS. Moderate stringency hybridization conditions correspond to a higher T_(m), e.g., 40% formamide, with 5× or 6×SSC. High stringency hybridization conditions correspond to the highest T_(m), e.g., 50% formamide, 0.1×SSC. Hybridization requires that the two nucleic acids contain complementary sequences, although depending on the stringency of the hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the degree of complementation, variables well known in the art. The greater the degree of similarity or homology between two nucleotide sequences, the greater the value of T_(m) for hybrids of nucleic acids having those sequences. The relative stability (corresponding to higher T_(m)) of nucleic acid hybridizations decreases in the following order: RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, equations for calculating T_(m) have been derived (see Sambrook et al., supra, 9.50-0.51). For hybridization with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches becomes more important, and the length of the oligonucleotide determines its specificity (see Sambrook et al., supra, 11.7-11.8). A minimum length for a hybridizable nucleic acid is at least about 10 nucleotides; preferably at least about 15 nucleotides; and more preferably the length is at least about 20 nucleotides.

[0051] In a specific embodiment, the term “standard hybridization conditions” refers to a T_(m) of 55 ° C., and utilizes conditions as set forth above. In a preferred embodiment, the T_(m) is 60° C.; in a more preferred embodiment, the T_(m) is 65 ° C. In a specific embodiment, “high stringency” refers to hybridization and/or washing conditions at 68° C. in 0.2×SSC, at 42° C. in 50% formamide, 4×SSC, or under conditions that afford levels of hybridization equivalent to those observed under either of these two conditions.

[0052] As used herein, the term “oligonucleotide” refers to a nucleic acid, generally of at least 10, preferably at least about 15, and more preferably at least about 20 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA molecule encoding a regulatory or protein binding segment, a gene, mRNA, cDNA, or other nucleic acid of interest. Oligonucleotides can be labeled, e.g., with ³²P-nucleotides or nucleotides to which a label, such as biotin, has been covalently conjugated. In one embodiment, a labeled oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another embodiment, oligonucleotides (one or both of which may be labeled) can be used as PCR primers, either for cloning full length or a fragment of the enhancer element, or to detect the presence of nucleic acids encoding the enhancer element. In a further embodiment, an oligonucleotide of the invention can form a triple helix with a DNA molecule containing one or more of the segments found in the enhancer element, e.g., to suppress positive regulation of the enhancer. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally occurring phosphoester analog bonds, such as thioester bonds, etc.

[0053] “Homologous recombination” refers to the insertion of a foreign DNA sequence of a vector in a chromosome. Preferably, the vector targets a specific chromosomal site for homologous recombination. For specific homologous recombination, the vector will contain sufficiently long regions of homology to sequences of the chromosome to allow complementary binding and incorporation of the vector into the chromosome. Longer regions of homology, and greater degrees of sequence similarity, may increase the efficiency of homologous recombination.

[0054] As used herein, the term “homologous” in all its grammatical forms and spelling variations refers to the relationship between proteins that possess a “common evolutionary origin,” including proteins from superfamilies (e.g., the immunoglobulin superfamily) and homologous proteins from different species (e.g., myosin light chain, etc.) (Reeck et al., Cell 50:667, 1987). Such proteins (and their encoding genes) have sequence homology, as reflected by their high degree of sequence similarity.

[0055] Accordingly, the term “sequence similarity” in all its grammatical forms refers to the degree of identity or correspondence between nucleic acid or amino acid sequences of proteins that may or may not share a common evolutionary origin (see Reeck et al., supra). However, in common usage and in the instant application, the term “homologous,” particularly when modified with an adverb such as “highly,” may refer to sequence similarity, which may or may not relate to a common evolutionary origin.

[0056] In a specific embodiment, two DNA sequences are “substantially homologous” or “substantially similar” when at least about 80% (preferably at least about 90%) of the nucleotides match over the defined length of the DNA sequences. Sequences that are substantially homologous can be identified by comparing the sequences using standard software available in sequence data banks, or in a Southern hybridization experiment under, for example, stringent conditions as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art. See, e.g., Maniatis et al., supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid Hybridization, supra.

[0057] The term “corresponding to” is used herein to refer to similar or homologous sequences, whether the exact position is identical or different from the molecule to which the similarity or homology is measured. A nucleic acid or amino acid sequence alignment may include spaces. Thus, the term “corresponding to” refers to the sequence similarity, and not the numbering of the amino acid residues or nucleotide bases.

[0058] A nucleic acid comprising one or more of segments A, B, and C, including an enhancer element (comprising all three segments in proximity), can be isolated from any source, particularly from a human genomic library. Methods for obtaining such nucleic acids are well known in the art, as described above (see, e.g., Sambrook et al., 1989, supra). Accordingly, any human cell potentially can serve as the nucleic acid source for the molecular cloning of a regulatory element. The DNA may be obtained by standard procedures known in the art from cloned DNA (e.g., a DNA “library”).

[0059] Once the DNA fragments are generated, identification of the specific DNA fragment containing the desired regulatory activity may be accomplished in a number of ways. For example, as shown in the Examples, regulation of expression of a reporter gene, such as luciferase, can establish that the regulatory element or elements have been obtained. Alternatively, oligonucleotide probes or primers can be used to detect the presence of a nucleic acid encoding the regulatory region. As noted above, the greater the degree of sequence similarity, the more stringent hybridization conditions can be used.

[0060] The present invention also relates to cloning or using recombinant means to prepare vectors containing genes encoding analogs and derivatives of the regulatory segments of the invention, that have the same or homologous functional activity as the segments, and homologs thereof from other species. Also contemplated, and specifically exemplified herein, are derivatives or analogs of the regulatory segments that do not bind to the specific proteins. Such “non-functional” derivatives or analogs can be used to evaluate the characteristics of the DNA binding proteins that recognize the regulatory segments of the invention. The production and use of derivatives and analogs are within the scope of the present invention.

[0061] Nucleic acids encoding derivatives and analogs of the invention can be produced by various methods known in the art. The manipulations which result in their production can occur at the gene or protein level. For example, the cloned regulatory segment can be modified by any of numerous strategies known in the art (Sambrook et al., 1989, supra). The sequence can be cleaved at appropriate sites with restriction endonuclease(s), followed by further enzymatic modification if desired, isolated, and ligated in vitro. Additionally, the regulatory segment can be mutated in vitro or in vivo, to create and/or destroy translation, initiation, and/or termination sequences, or to create variations in coding regions and/or form new restriction endonuclease sites or destroy preexisting ones, to facilitate further in vitro modification. Any technique for mutagenesis known in the art can be used, including but not limited to, in vitro site-directed mutagenesis (Hutchinson, C., et al., J. Biol. Chem. 253:6551, 1978 ; Zoller and Smith, DNA 3:479-488, 1984; Oliphant et al., Gene 44:177, 1986; Hutchinson et al., Proc. Natl. Acad. Sci. U.S.A. 83:710, 1986), use of TAB™ linkers (Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see Higuchi, “Using PCR to Engineer DNA”, in PCR Technology: Principles and Applications for DNA Amplification, H. Erlich, ed., Stockton Press, 1989, Chapter 6, pp. 61-70).

[0062] A regulatory element of the invention can be inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of a protein-coding sequence. Thus, the regulatory element of the invention is operatively associated with a gene in an expression vector of the invention. Both cDNA and genomic sequences can be cloned and expressed under control of such regulatory sequences. An expression vector also preferably includes a replication origin.

[0063] Potential host-vector systems include but are not limited to mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their strengths and specificities. Depending on the host-vector system utilized, any one of a number of suitable transcription and translation elements may be used.

[0064] A recombinant gene may be expressed chromosomally under control of a regulatory element of the invention after integration of the coding sequence by recombination. In this regard, any of a number of amplification systems may be used to achieve high levels of stable gene expression (See Sambrook et al., 1989, supra).

[0065] Any of the methods for the insertion of DNA fragments into a cloning vector may be used to construct expression vectors. These methods may include in vitro recombinant DNA and synthetic techniques and in vivo recombination (genetic recombination).

[0066] Expression of a protein under control of a regulatory element of the invention may be controlled by any promoter known in the art, so long as the promoter is functional in the host selected for expression. Promoters which may be used to control gene expression include, but are not limited to, the cytomegalovirus immediate early (CMV) promoter, the SV40 early promoter region (Benoist and Chambon, Nature 290:304-310, 1981), the promoter contained in the 3′ long terminal repeat of Rous sarcoma virus (Yamamoto, et al., Cell 22:787-797, 1980), the herpes thymidine kinase promoter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445, 1981), and the regulatory sequences of the metallothionein gene (Brinster et al., Nature 296:39-42, 1982). In specific embodiments, the regulatory element of the invention is operably associated with an HSVtk promoter and with a β₃-AR promoter. In a preferred embodiment, a heterologous gene is expressed under control of a β₃-AR promoter.

[0067] Vectors are introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (liposome fusion), use of a gene gun, or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut et al., Canadian Patent Application No. 2,012,311, filed March 15, 1990).

Gene Therapy and Transgenic Vectors

[0068] As discussed above, a vector is any means for the transfer of a nucleic acid according to the invention into a host cell. The regulatory elements of the present invention, which permit positive, tissue-specific expression control, are useful in conjunction with delivery of a therapeutic gene in vivo or ex vivo, e.g., gene therapy. Such vectors can be viral vectors, such as retroviruses, herpes viruses, adenoviruses and adeno-associated viruses, or non-viral vectors.

[0069] Viral vectors commonly used for in vivo or ex vivo targeting and therapy procedures are DNA-based vectors and retroviral vectors. Methods for constructing and using viral vectors are known in the art (see, e.g., Miller and Rosman, BioTechniques 7:980-990, 1992). Preferably, the viral vectors are replication defective, that is, they are unable to replicate autonomously in the target cell. In general, the genome of the replication defective viral vectors used within the scope of the present invention lack at least one region which is necessary for the replication of the virus in the infected cell. These regions can either be eliminated (in whole or in part), be rendered non-functional by any technique known to a person skilled in the art. These techniques include the total removal, substitution (by other sequences, in particular by the inserted nucleic acid), partial deletion, or addition of one or more bases to an essential (for replication) region. Such techniques may be performed in vitro (on the isolated DNA) or in situ, using the techniques of genetic manipulation or by treatment with mutagenic agents. Preferably, the replication defective virus retains the sequences of its genome which are necessary for encapsidating the viral particles.

[0070] DNA viral vectors include an attenuated or defective DNA virus, such as but not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely or almost entirely lack viral genes, are preferred. Defective virus is not infective after introduction into a cell. Thus, a specific tissue can be specifically targeted. Examples of particular vectors include, but are not limited to, a defective herpes virus 1 (HSV1) vector (Kaplitt et al., Molec. Cell. Neurosci. 2:320-330, 1991), defective herpes virus vector lacking a glyco-protein L gene (Patent Publication RD 371005 A), or other defective herpes virus vectors (International Patent Publication No. WO 94/21807, published Sep. 29, 1994; International Patent Publication No. WO 92/05263, published Apr. 2, 1994); an attenuated adenovirus vector, such as the vector described by Stratford-Perricaudet et al. (J. Clin. Invest. 90:626-630, 1992; see also La Salle et al., Science 259:988-990, 1993); and a defective adeno-associated virus vector (Samulski et al., J. Virol. 61:3096-3101, 1987; Samulski et al., J. Virol. 63:3822-3828, 1989; Lebkowski et al., Mol. Cell. Biol. 8:3988-3996, 1988).

[0071] Adenovirus vectors. Adenoviruses are eukaryotic DNA viruses that can be modified to efficiently deliver a nucleic acid of the invention to a variety of cell types. Various serotypes of adenovirus exist. Of these serotypes, type 2 or type 5 human adenoviruses (Ad 2 or Ad 5) or adenoviruses of animal origin (see WO94/26914) are most commonly used. Those adenoviruses of animal origin which can be used within the scope of the present invention include adenoviruses of canine, bovine, murine (example: Mavl, Beard et al., Virology 75 (1990) 81), ovine, porcine, avian, and simian (example: SAV) origin. Preferably, the adenovirus of animal origin is a canine adenovirus, more preferably a CAV2 adenovirus (e.g. Manhattan or A26/61 strain (ATCC VR-800), for example).

[0072] Preferably, the replication defective adenoviral vectors of the invention comprise the inverted terminal repeats (ITRs), an encapsidation sequence and the nucleic acid of interest. Still more preferably, at least the E1 region of the adenoviral vector is non-functional. The deletion in the E1 region preferably extends from nucleotides 455 to 3329 in the sequence of the Ad5 adenovirus (PvuII-BgIII fragment) or 382 to 3446 (HinfII-Sau3A fragment). Other regions may also be modified, in particular the E3 region (WO95/02697), the E2 region (WO94/28938), the E4 region (WO94/28152, WO94/12649, WO95/02697 and WO96/22378), or in any of the late genes L1-L5.

[0073] The replication defective recombinant adenoviruses according to the invention can be prepared by any technique known to the person skilled in the art (Levrero et al., Gene 101 (1991) 195, EP 185 573; Graham, EMBO J. 3 (1984) 2917). In particular, they can be prepared by homologous recombination between an adenovirus and a plasmid which carries, inter alia, the DNA sequence of interest. The homologous recombination is effected following cotransfection of the adenovirus and plasmid into an appropriate cell line. The cell line which is employed should preferably (i) be transformable by the elements, and (ii) contain the sequences which are able to complement the part of the genome of the replication defective adenovirus, preferably in integrated form in order to avoid the risks of recombination. Examples of cell lines which may be used are the human embryonic kidney cell line 293 (Graham et al., J. Gen. Virol. 36 (1977) 59) which contains the left-hand portion of the genome of an Ad5 adenovirus (12%) integrated into its genome, and cell lines which are able to complement the E1 and E4 functions, as described in applications WO94/26914 and WO95/02697. Recombinant adenoviruses are recovered and purified using standard molecular biological techniques, which are well known to one of ordinary skill in the art.

[0074] Adeno-associated viruses. The adeno-associated viruses (AAV) are DNA viruses of relatively small size which can integrate, in a stable and site-specific manner, into the genome of the cells which they infect. They are able to infect a wide spectrum of cells without inducing any effects on cellular growth, morphology or differentiation, and they do not appear to be involved in human pathologies. The AAV genome has been cloned, sequenced and characterized. It encompasses approximately 4700 bases and contains an inverted terminal repeat (ITR) region of approximately 145 bases at each end, which serves as an origin of replication for the virus. The remainder of the genome is divided into two essential regions which carry the encapsidation functions: the left-hand part of the genome, which contains the rep gene involved in viral replication and expression of the viral genes; and the right-hand part of the genome, which contains the cap gene encoding the capsid proteins of the virus. The use of vectors derived from the AAVs for transferring genes in vitro and in vivo has been described (see WO 91/18088; WO 93/09239; U.S. Pat. Nos. 4,797,368, 5,139,941, EP 488 528).

[0075] Retrovirus vectors. In another embodiment the gene can be introduced in a retroviral vector, e.g., as described in Anderson et al., U.S. Pat. No. 5,399,346; Mann et al., 1983, Cell 33:153; Temin et al., U.S. Pat. No. 4,650,764; Temin et al., U.S. Pat. No. 4,980,289; Markowitz et al., 1988, J. Virol. 62:1120; Temin et al., U.S. Pat. No. 5,124,263; EP 453242, EP178220; Bernstein et al. Genet. Eng. 7:235, 1985; McCormick, BioTechnology 3:689, 1985; International Patent Publication No. WO 95/07358, published Mar. 16, 1995, by Dougherty et al.; and Kuo et al., 1993, Blood 82:845. These vectors can be constructed from different types of retrovirus, such as, HIV, MoMuLV (“murine Moloney leukaemia virus” MSV (“murine Moloney sarcoma virus”), HaSV (“Harvey sarcoma virus”); SNV (“spleen necrosis virus”); RSV (“Rous sarcoma virus”) and Friend virus. Defective retroviral vectors are disclosed in WO95/02697. Packaging cell lines for preparation of retroviral vectors have been described in the prior art, in particular the cell line PA317 (U.S. Pat. No. 4,861,719); the PsiCRIP cell line (WO90/02806) and the GP+envAm-12 cell line (WO89/07150). In addition, the recombinant retroviral vectors can contain modifications within the long terminal repeats (LTRs) for suppressing transcriptional activity as well as extensive encapsidation sequences which may include a part of the gag gene (Bender et al., J. Virol. 61:1639, 1987). Recombinant retroviral vectors are purified by standard techniques known to those having ordinary skill in the art.

[0076] Non-viral vectors. Alternatively, the vector can be introduced in vivo as naked DNA, or with a DNA transfer facilitating agent, such as a lipid.

[0077] For example, one method for transfer of a nucleic acid vector is by lipofection. Synthetic cationic lipids designed to limit the difficulties and dangers encountered with liposome mediated transfection can be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Felgner, et. al., Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; see Mackey, et al., Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031, 1988; Ulmer et al., Science 259:1745-1748, 1993). The use of cationic lipids may promote encapsulation of negatively charged nucleic acids, and also promote fusion with negatively charged cell membranes (Felgner and Ringold, Science 337:387-388, 1989). Particularly useful lipid compounds and compositions for transfer of nucleic acids are described in International Patent Publications WO95/18863 and WO96/17823, and in U.S. Pat. No. 5,459,127. The use of lipofection to introduce exogenous genes into the specific organs in vivo has certain practical advantages. Molecular targeting of liposomes to specific cells represents one area of benefit. It is clear that directing transfection to particular cell types would be particularly advantageous in a tissue with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids may be chemically coupled to other molecules for the purpose of targeting (see Mackey, et. al., supra). Targeted peptides, e.g., hormones or neurotransmitters, and proteins such as antibodies, or non-peptide molecules could be coupled to liposomes chemically.

[0078] Other molecules are also useful for facilitating transfection of a nucleic acid in vivo, such as a cationic oligopeptide (e.g., International Patent Publication WO95/21931), peptides derived from DNA binding proteins (e.g., International Patent Publication WO96/25508), or a cationic polymer (e.g., International Patent Publication WO95/2193 1).

[0079] It is also possible to introduce the vector in vivo as a naked DNA plasmid. Naked DNA vectors for gene therapy can be introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, use of a gene gun, or use of a DNA vector transporter (see, e.g., Wu et al., J. Biol. Chem. 267:963-967, 1992; Wu and Wu, J. Biol. Chem. 263:14621-14624, 1988; Hartmut et al., Canadian Patent Application No. 2,012,311, filed Mar. 15, 1990; Williams et al., Proc. Natl. Acad. Sci. USA 88:2726-2730, 1991). Receptor-mediated DNA delivery approaches can also be used (Curiel et al., Hum. Gene Ther. 3:147-154, 1992; Wu and Wu, J. Biol. Chem. 262:4429-4432, 1987).

Screening Assays

[0080] Identification and isolation of the regulatory elements of the invention provides for development of screening assays, particularly for high throughput screening of molecules that up- or down-regulate the activity of the regulatory element of the invention. In one embodiment, control of the regulatory element can be effected by increasing or decreasing the activity of a trans-acting factor (preferably the B-segment-binding trans-acting factor) of the invention. Alternatively, the factor can act directly by interacting with the sequences of the regulatory segments or region.

[0081] Molecules that increase the activity of the regulatory element of the invention, and thus increase the level of expression of endogenous β₃-AR, would be useful in combination with a β₃-AR-specific agonist for the treatment of obesity and diabetes, particularly non-insulin dependent diabetes mellitus (NIDDM). In particular, an agent that induces expression of the B-segment-specific trans-acting factor discovered herein would be useful for inducing β₃-AR expression in a desired tissue, such as a WAT cell or an undifferentiated adipocyte (so as to push it to differentiation into a BAT cell), so as to render it sensitive to a β₃-AR agonist. In addition, such agents may be useful in combination with a β₃-AR agonist for treating urinary incontinence and to suppress ureteral colic; in the facilitation of stone discharge in urolithiasis patients; and in the treatment of depression (at least one β₃-AR agonist having been shown to have anti-depressant activity).

[0082] Any mammalian cell can be used to screen for molecules that increase or decrease the activity of a β₃-AR trans-activating factor. Preferably the cells express or have the potential to express the β₃-AR trans-activating factor, such as BAT (including HIB) or SK-N-MC cells. BAT and SK-N-MC cells have been discussed. The HIB 1B cell line was derived from brown adipose tissue (BAT) tumor isolated from transgenic mice (Ross et al., PNAS, 89:7561-7565, 1992). This cell line represents the first established brown adipocytes cell line capable of expressing uncoupling protein 1 (UCP-1). UCP-1 is a mitochondrial protein expressed exclusively in BAT. UCP-1 mRNA is detectable after stimulation of β-adrenergic receptor system that is present in these cells (β₁-and β₂-AR, 60-70%, and β₃-AR, 30-40%) (Klaus et al., J. Cell. Sci., 107:313-319, 1994). The HIB 1B cells are maintained in DMEM:F12 (1:1) in the presence of 10% FCS. When they reach confluency, cells are induced to differentiate (i.e., to show UCP-1 expression and accumulation of lipid droplets). Differentiation is achieved by adding 5% FBS in addition to T₃ and insulin, to the medium. Seven days after confluency in the presence of T₃ and insulin, 90-95% of the cells are differentiated.

[0083] Any screening technique known in the art can be used to screen for compounds that up- or down-regulate the activity of the regulatory region of the invention. As used herein, the term “compound” refers to any molecule or complex of more than one molecule that affects the regulatory region. The present invention contemplates screens for synthetic small molecule agents, chemical compounds, chemical complexes, and salts thereof as well as screens for natural products, such as plant extracts or materials obtained from fermentation broths. Other molecules that can be identified using the screens of the invention include proteins and peptide fragments, peptides, nucleic acids and oligonucleotides (particularly triple-helix-forming oligonucleotides), carbohydrates, phospholipids and other lipid derivatives, steroids and steroid derivatives, prostaglandins and related arachadonic acid derivatives, etc.

[0084] Knowledge of the primary sequence of the regulatory region, and particularly the regulatory segments, permits development of DNA binding molecules, including triple-helix forming oligonucleotides, that can be used to interfere with transcription factor binding to the regulatory region, and thus inhibit or block positive gene regulation mediated by the region.

[0085] Various approaches can be used to identify small molecules for testing. One approach uses primarily chemical methods, of which the Geysen method (Geysen et al., Molecular Immunology 23:709-715, 1986; Geysen et al., J. Immunologic Method 102:259-274, 1987) and the method of Fodor et al. Science 251:767-773, 1991) are examples. Furka et al. (14th International Congress of Biochemistry, Volume 5, Abstract FR:013, 1988; Furka, Int. J. Peptide Protein Res. 37:487-493, 1991), Houghton (U.S. Pat. No. 4,631,211, issued Dec. 1986) and Rutter et al. (U.S. Pat. No. 5,010,175, issued Apr. 23, 1991) describe methods to produce a mixture of peptides that can be tested as agonists or antagonists.

[0086] In another aspect, synthetic libraries (Needels et al., Proc. Natl. Acad. Sci. USA 90:10700-4, 1993); Ohlmeyer et al., Proc. Natl. Acad. Sci. USA 90:10922-10926, 1993); Lam et al., International Patent Publication No. WO 92/00252; Kocis et al., International Patent Publication No. WO 9428028) and the like can be used to screen for agents according to the present invention.

Reporter Gene Assays

[0087] The screening can be performed with recombinant cells that express a reporter gene under control of the regulatory region of the invention, particularly the enhancer region. For example, in an example, infra, luciferase (a reporter gene) is placed under control of the enhancer region for detecting enhancement of reporter gene expression.

[0088] In one embodiment, a reporter gene assay can be used to detect increased expression of a gene under control of the β₃-AR enhancer element in a cell that does not normally express or expresses at very low levels β₃-AR relative to β₃-AR-expressing cells. Such cells include WAT cells (including perirenal cells), muscle cells, liver cells and HeLa and CV-1 cells. In a specific embodiment, mouse WAT, muscle, and liver cells did not express the B segment trans-activating factor. These cells, recombinantly engineered with a reporter gene under control of the enhancer region, can be used to screen for molecules that induce trans-activating factor activity.

[0089] In another embodiment, increased levels of expression of the B segment binding trans-activating factor can be detected in cells that express β₃-AR, including but not limited to BAT cells (including HIB cells, which is a BAT-derived cell line), and human neuroblastoma cells. In both cases, an increase in the level of expression of the reporter gene, relative to the level of expression in the absence of a test compound, indicates that the test compound has a positive influence on regulation of expression under control of the regulatory element of the invention.

[0090] Alternatively, a reporter gene assay can be used to test inhibitors of the regulatory region. Such reporter gene assays are most conveniently performed on cells that constitutively express high levels of a reporter gene operatively associated with the regulatory element. A reduction in the level of expression of the reporter gene in the presence of a test compound, relative to the level of expression in the absence of the test compound, indicates that the test compound inhibits expression control by the regulatory element.

[0091] Reporter genes for use in the invention encode detectable proteins, including, but are by no means limited to, chloramphenicol transferase (CAT), β-galactosidase (β-gal), luciferase, green fluorescent protein, alkaline phosphatase, and other genes that can be detected, e.g., immunologically (by antibody assay).

Direct Assays for B Segment Binding Trans-Activating Factor Activity

[0092] As an alternative or an adjunct to the reporter gene assays described above, the present invention permits directly assessing the level of expression of the B segment binding trans-activating protein by detecting the amount of this protein associated with a nucleic acid containing the B segment core binding sequence. Such assays include gel shift assays, solid phase binding assays, and the like.

[0093] For such assays, preferably either the nucleic acid (such as an oligonucleotide) containing the B segment sequence, or the protein, or both, are detectably labeled so that any binding of the two can be detected. Such labels include enzymes, such as alkaline phosphatase and horseradish peroxidase; colored latex beads; magnetic beads; fluorescent labels, e.g., fluorescein isothiocyanate (FITC), phycoerythrin (PE), Texas red (TR), rhodamine, free or chelated lanthanide series salts, especially Eu³⁺, to name a few fluorophores; fluorescence transfer pairs; chemiluminescent molecules; radioisotopes; or magnetic resonance imaging labels. In a specific embodiment, the oligonucleotide is radiolabeled.

Isolation and Characterization of the B Segment Trans-Activating Factor

[0094] A significant and unexpected discovery of the present invention is the identification and characterization of a novel AP-2-like transcription factor, the B-segment binding trans-activating factor. This factor binds to DNA having the sequence GCCTCTGGGGAG (SEQ ID NO:1), and is expressed, inter alia, by mouse brown adipose tissue cells and the human neuroblastoma cell line SK-N-MC; it is also expressed at very low levels in perirenal white adipose tissue (and perhaps not at all in pure white adipose tissue cells), and an AP-2 oligonucleotide does not compete with the B segment sequence for binding to this factor. Furthermore, the factor is not recognized by an anti-AP-2 antibody. Although the presence and characteristics that permit unambiguous identification of this factor are set forth herein, the invention advantageously permits further evaluation and characterization, including elucidation of the sequence of the gene encoding the factor, and accordingly deduction of the complete amino acid sequence.

[0095] The transcription factor AP-2 was first isolated from HeLa cells by affinity chromatography and subsequently cloned by screening a HeLa genomic library. AP-2 is a 52 kDa protein which gene was mapped to a region on chromosome 6. The DNA binding domain within AP-2 transcription factor is located in C-terminal half of protein and consists of two putative amphipathic alpha helices separated by a large spanning region. The N-terminal domain of AP-2 protein contains a trans-activation domain with a proline-rich region. AP-2 transcription factor binds to a consensus binding site −5′-GCCNNNGGC-3′ (SEQ ID NO:4) that is found in numerous viral and cellular promoters. Activity of AP-2 is regulated by different agents. Phorbol-ester and agents that lead to an increase of cAMP induces AP-2 activity independently of protein synthesis (Buttner et al., Mol. Cell. Biology, 13:4174-4185, 1993; Bauer et al., Nucleic Acid Research, 22:1413-1420, 1994.)

[0096] Various means can be used to isolate the B segment binding factor protein or gene, including a yeast one-hybrid assay in a system recombinantly engineered to express polypeptides from cells that express the β₃-AR. Such cells include BAT cells (including HIB cells), and human neuroblastoma cells, such as SK-N-MC cells. The yeast one hybrid system was described by Fields and Song (Nature, 340:245-247, 1989). Numerous laboratories have used this system to successfully clone unknown transcription factors that bind for identified DNA binding sites (e.g., Wu et al., EMBO J., 13:4823-4830, 1994). The yeast-one hybrid system provides the tool for isolation of novel genes that encode the proteins that bind to a target: cis-acting regulatory DNA elements. The one-hybrid assay is based on the interaction between a target-specific DNA-binding domain and a target-independent activation domain (AD). The binding domain for target DNA is provided in a construct with the AD. If the binding protein that recognizes the target DNA is present in the cDNA library, its expression will allow binding to target DNA. Binding to the target DNA results in expression and binding of AD to transcriptional machinery of another vector to induce expression of a reporter gene.

[0097] Three copies of oligonucleotides that correspond to region B are synthesized, annealed and cloned into the Eco RI and the Mlu I sites of pHISi, pHISI-1 and pLacZi vectors (Clontech). B/pHISi and B/pHISi-1 clones are then stably transfected into the genome of yeast strain YM4271. The yeast cells are transformed according to established protocol using LiAc method and colonies selected by growth on SD-His plates. To confirm that the clones have stably integrated B/pHISi or B/pHISi-1 plasmids, colonies are picked and lysed, and PCR is performed using primers on each side of the insert flanking the NcoI site. PCR products that contain the proper size insert are considered positive. The B/pHISi/YM4271 stably integrated yeast cells are transformed with a mouse brown adipose tissue (BAT) library constructed into a pGAD10 vector (custom-made library by Clontech). Colonies are selected by growth on SD/-his/-leu plates supplemented with 30 mM 3-AT (3-amino-1,2,4-triazole). Larger colonies which grow up are streaked onto grid plates and a β-galactosidase assay is performed by colony lifts according to the manufacturer's protocol. PCR is performed on lysed yeast colonies using primers flanking the insert region of the pGAD10 vector. PCR products containing inserts are ligated using T-A cloning system into pCRII vector. Colonies that contain inserts are sequenced and subjected to BLAST search.

[0098] Screening of an expression library generated from SK-N-MC poly-A RNA with a radiolabeled segment “B” oligonucleotide is another strategy for cloning and isolation of the transcription factor that binds for “B” sequence.

[0099] Alternatively, a nucleic acid, and particularly a DNA oligonucleotide, comprising the B segment core binding sequence GCCTCTGGGGAG (SEQ ID NO:1) can be used to isolate the protein, e.g., by an affinity chromatography technique. Alternatively, such an oligonucleotide can be used to capture polysomes expressing the B segment binding factor, permitting isolation and reverse cloning of the B segment binding factor mRNA.

[0100] The present invention will be better understood by reference to the following Example.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0101] The Examples in this application are intended to be exemplary thereof, and are not intended to be limiting.

EXAMPLE 1 Isolation and Characterization of a β₃-AR Enhancer

[0102] This example presents data that show the existence of hβ₃-AR cis-acting positive regulatory elements between −6.5 kb to−6.3 kb of the 5′-flanking region. These positive regulatory elements consist of three DNA binding sequences: A, B, and C that act synergistically. Region A binds Sp1 binding factor, region B is recognized by proteins (as yet unidentified) present in nuclear extracts from SK-N-MC cells as well as mouse BAT, and region C is represented by 20 repeats of a CCTT motif that is necessary to achieve full transcriptional activity of hβ₃-AR enhancer.

Materials and Methods

[0103] Cloning of human β₃-AR genomic DNA. In order to isolate a 5′-flanking region of the hβ₃-AR gene, a Human Fibroblast Genomic Library (Stratagene, LaJolla, Calif.) was screened using a 1.2 kb insert from cDNA for the hβ₃-AR gene. cDNA was constructed by ligating four polymerase chain reaction (PCR) products using the following primers: an ATG-NarI fragment, sense primer 5′ CTTTCCCTACCGCCCCACGCGCGATC 3′ (SEQ ID NO:5) and anti-sense primer 5′ GTGGCGCCCAACGGCCAGTGGCCAGTC 3′ (SEQ ID NO:6); a NarI-AccI fragment, 5′ TTGGCGCTGACTGGCCACTGGCCGTTG 3′ (SEQ ID NO:7) as sense and 5′ GCGCGTAGACGAAGAGCATCACGAG 3′ (SEQ ID NO:8) as anti-sense primer; an AccI-Styl fragment, sense primer 5′ CTCGTGATGCTCTTCGTCTSCGCGC 3′ (SEQ ID NO:9) and anti-sense primer 5′ GTGAAGGTGCCCATGATGAGACCCAAGG 3′ (SEQ ID NO:10) and a Styl-TAG fragment, with sense primer 5′ CCCTGTGCACCTTGGGTCTCATCATGG 3′ (SEQ ID NO:11) and anti-sense primer 5′ CCTCTGCCCCGGTTACCTACCC 3′ (SEQ ID NO:12). The corresponding primer sequences were taken according to Emorine et al. (Science 245:1118-21).

[0104] The four fragments were ligated into a pUC 18 plasmid (Gibco-BRL) and sequenced. Using cDNA as a probe two genomic clones were isolated. In addition a PCR product representing a 1.3 kb hβ₃-AR promoter that was previously published (Liggett et al., J. DNA Sequencing and Mapping 2:61-63, 1991) was used to identify and map 7 kb of the hβ₃-AR gene 5′-flanking region. This 7 kb promoter region was subcloned into pSP 72 (Promega, Madison, Wis.) between PstI and HindIII restriction enzyme sites and mapped extensively. The full length of the promoter was sequenced in both directions using automated sequencing with primers designed to “walk” along the sequence as further sequences were obtained.

[0105] 5′ RACE. To identifythe transcription start site ofthe hβ₃-AR gene, 5′ RACE was performed using SK-N-MC cell total RNA purified using RNAzol B (Tel-Test, Inc., Friendswood, Tex.). Poly A RNA was then purified using an Ambion Micropo(A) pure kit. The 5′ RACE was done using the Marathon cDNA amplification kit (Clonetech, Palo Alto, Calif.) according to the protocol provided with the kit. The primers used included: the sense adapter primer AP2 (Clonetech) 5′ ACTCACTATAGGGCTCGAGCGGC 3′ (SEQ ID NO:13) and antisense primer 5′ GGCAGCCCACTGGTGTTGGCGGTAT 3′ (SEQ ID NO:14) that corresponded to the hβ₃-AR gene sequence at positions 729-703 as per the sequence from GenBank with accession # X72861 (positioned 81 base pairs in the 5′-3′ direction from the translation start site). The PCR products were then cloned into a pCRII vector using the TA cloning kit (Invitrogen, Carlsbad, Calif.). The clones were sequenced using the ABI 373 Automated Sequencer.

[0106] Human β₃-AR gene promoter deletion constructs. Serial deletions of the hβ₃-AR gene 5′-flanking region were designed in order to identify the region(s) responsible for transcriptional regulation. A 7 kb sequence was subcloned into the KpnI/HindIII sites of a pGL3 basic vector (Promega) to obtain the full length promoter to drive expression of the reporter gene luciferase labeled here as −7 hβ₃/Luc (KpnI is a part of a multiple cloning site from pSP72) pGL3 basic contains luciferase cDNA as a reporter gene and an upstream synthetic poly-A signal to reduce background. The deletion constructs labeled as −5.5hβ3/Luc, −3hβ3/Luc and 0.5hβ3/Luc were made by digestion with AvrII, EcoRV and BstEII, respectively and KpnI, blunt ended and re-ligated. The construct −.3hβ3/Luc was made by ligating a PCR product corresponding to a 1.3 kb region of the hβ₃-AR gene promoter that has been previously published (Liggett and Schwina, supra). The PCR product was obtained using HeLa cell genomic DNA and the 5′ ggtaccTCTAGGTGGAAAGGTGCATG 3′ (SEQ ID NO:15) as sense and 5′ aagcttAGTCCCCTCCCTGTCGT 3′ (SEQ ID NO:16) as anti-sense primers (GenBank sequence with accession #M62473) was used when choosing the primers.

[0107] Constructs with deletions in the promoter region were made as follows: for the dEVhβ3AR/Luc parental vector −7hβ3AR/Luc was digested with Avr II (−5.6 kb) and EcoRV (−3.1 kb), blunt ended and re-ligated; for dEIhβ3AR/Luc, a deletion between Avr II and EcoRI (−2.3 kb) was made, and dBhβ3AR/Luc was made by digestion of the parental vector with Avr II and BstEII (−0.5 kb); sites of digestion were blunt ended and re-ligated.

[0108] To further analyze and more precisely identify the sequence located between −7 kb and −5.6 kb of the hβ₃-AR promoter that contains cis-regulatory elements, expression plasmids were generated containing PCR products made using a series of primers shown in Table 1. 

What is claimed is:
 1. An isolated β₃-AR trans-activating factor polypeptide, wherein said polypeptide has the following characteristics: (a) it binds specifically to a nucleic acid having the nucleotide sequence GCCTCTGGGGAG (SEQ ID NO: 1); (b) it is expressed by brown adipose tissue cells; (c) it is expressed at very low levels by cells isolated from the perirenal depot; (d) an AP-2 binding nucleic acid does not compete with a nucleic acid comprising a nucleotide sequence GCCTCTGGGGAG (SEQ ID NO: 1) for binding the polypeptide; and, (e) when complexed to a nucleic acid comprising SEQ ID NO: 1, it is not recognized by an antibody to AP-2.
 2. A composition comprising an isolated β₃-AR trans-activating factor polypeptide, wherein said polypeptide has the following characteristics: (a) it binds specifically to a nucleic acid having the nucleotide sequence GCCTCTGGGGAG (SEQ ID NO: 1); (b) it is expressed by brown adipose tissue cells; (c) it is expressed at very low levels by cells isolated from the perirenal depot; (d) an AP-2 binding nucleic acid does not compete with a nucleic acid comprising a nucleotide sequence GCCTCTGGGGAG (SEQ ID NO: 1) for binding the polypeptide; and, (e) when complexed to a nucleic acid comprising SEQ ID NO: 1, it is not recognized by an antibody to AP-2.
 3. The composition of claim 2 further comprising an isolated nucleic acid having the nucleotide sequence GCCTCTGGGGAG (SEQ ID NO: 1). 